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We study the problem of constructing universal Steiner trees for undirected graphs. Given a graph 
G and a root node r, we seek a single spanning tree T of minimum stretch, where the stretch of T is 
defined to be the maximum ratio, over all terminal sets X, of the cost of the minimal sub-tree Tx of T 
that connects X to r to the cost of an optimal Steiner tree connecting X to r in G. Universal Steiner 
trees (USTs) are important for data aggregation problems where computing the Steiner tree from scratch 
for every input instance of terminals is costly, as for example in low energy sensor network applications. 



We provide a polynomial time UST construction for general graphs with 2'-'(^'°s")-stretch. We also 
give a polynomial time polylogarithmic-stretch construction for minor-free graphs. One basic building 
block of our algorithms is a hierarchy of graph partitions, each of which guarantees small strong diameter 
for each cluster and bounded neighbourhood intersections for each node. We show close connections 
between the problems of constructing USTs and building such graph partitions. Our construction of 
partition hierarchies for general graphs is based on an iterative cluster merging procedure, while the 
one for minor-free graphs is based on a separator theorem for such graphs and the solution to a cluster 
aggregation problem that may be of independent interest even for general graphs. To our knowledge, 
this is the first subpolynomial-stretch (o(ri') for any e > 0) UST construction for general graphs, and the 
first polylogarithmic-stretch UST construction for minor-free graphs. 
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Abstract 



1 Introduction 



In this paper, we study universal approximations for the Steiner Tree problem on undirected graphs. In the 
universal Steiner Tree (UST) problem for graphs, we are given an undirected graph G and a designated root 
vertex r of G, and the task is to find a single spanning tree T of G such that for any set X of terminal 
vertices, the minimal subtree Tx of T that connects X to r is a good approximation to the optimal Steiner 
tree connecting X to r in G. The quality of the solution T is measured by its stretch which is the maximum 
ratio, over all terminal sets X, of the cost of Tx to the cost of the optimal Steiner tree connecting X to r in 
G. 

The universal Steiner tree problem has been studied extensively for the case of metrics where one is 
allowed to output an "overlay tree", whose edges correspond to paths in the given graph [2T1 [191 l9l l30l. 
Equivalently, the case of metrics can be viewed as a complete graph in which all edge weights satisfy the 
triangle inequality. In fact, for the case of metrics, there have been several important results on extensions 
of the UST problem and variants seeking sparse network structures that simultaneously approximates the 
optimal solution for a range of input instances |[T5l[T7l[T6l[T9l . 

The focus of this paper is on the UST problem on arbitrary graphs where we require that the solution 
being sought is a spanning tree, i.e., a subgraph of the given graph. The Minimum Steiner tree problem on a 
graph can be well-approximated by solving the same problem on the metric induced by the graph and then 
computing the minimum subtree connecting the terminals. Such an approach, however, does not apply to 
the UST problem owing to the requirement that the tree simultaneously approximate the optimal Steiner tree 
for all terminal sets. Note that this is a much stronger requirement than asking for a probability distribution 
over spanning trees where for every terminal set, the expected stretch is small. In the latter case, there might 
not be any single tree in the distribution that is good for all terminal sets, i.e., for every tree there is a set of 
terminals for which the induced tree has a cost much larger than the optimal steiner tree. 

Motivation. Our problem formulation is primarily motivated by information aggregation and data dissemi- 
nation in sensor and ad-hoc wireless networks |[25ll26ll24ll . In a sensor network, data is often collected by a 
central processing agent that periodically queries a subset of sensors for their sensed information. In many 
applications, the queries seek aggregate information which can be transmitted using a low cost tree that 
aggregates data at intermediate nodes. This reduces the number of transmissions which is crucial as sensors 
have limited battery life and wireless transmissions are power intensive. It is not realistic, however, to expect 
the sensors to compute and store a low cost tree for each potential subset of sensors being aggregated as the 
sensors have limited memory and computational power. In this setting, a universal tree provides a practical 
solution where the nodes just need to realize a single tree which approximates optimal aggregation trees for 
all subsets of sensors. One approach for the above aggregation problem is to employ a universal overlay 
tree. There are several disadvantages of this approach, however. First, aggregation over the overlay tree 
requires a physical routing infrastructure that supports point-to-point communication among distant nodes 
in the network. Second, even if such an infrastructure exists, it may not route packets along minimum-cost 
paths as required by the overlay tree. Furthermore, aggregation over the overlay tree requires synchroniza- 
tion among distant nodes in the network and incurs overhead in terms of delays and storage. Thus, in some 
resource-constrained applications, we would ideally want to construct a universal spanning tree as opposed 
to an overlay tree. 

Another motivation to study universal approximation algorithms comes from their relation with differ- 
ential privacy which was recently established by Bhalgat, Chakrabarty and Khanna [9|. They showed that 
universal solutions such as USTs are differentially private, and argued that a kind of "strong" lower bounds 
for universal algorithms impUes lower bounds for differentially private ones as well. 

From a theoretical standpoint, our motivation is to find out whether the results known for UST and re- 
lated problems in the metric case can, in fact, be achieved using spanning trees of the underlying graph. 
The analogous question for the problem of approximating metrics by tree metrics has been answered affir- 
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matively by ifTSl [B who showed that nearly logarithmic-stretch spanning trees exist for all graphs, almost 
matching the best bound achievable by tree metrics [14J. No comparable results are known for the UST 
problem. 

1.1 Our results and techniques 

We present UST algorithms for general graphs and for the special class of minor-free graphs. Our main 
results are the following. 

• UST for general graphs: We present a polynomial-time algorithm for computing an 2'^(^^°s")- 
stretch spanning tree for any undirected graph. 

• UST for minor-free graphs: We present a polynomial-time algorithm for computing a poly logarithmic- 
stretch spanning tree for any graph that is if-minor free for any finite graph H. 

While the specific techniques used in the two algorithms are substantially different, both are grounded in a 
general framework that draws close connections between USTs and certain graph partitions based on strong 
diameter. We define an (a, /3, 7)-partition of a graph G as a partition of the vertices of G into clusters such 
that each cluster has strong diameter at most 07 and for every vertex the ball of radius 7 in G intersects at 
most /3 clusters. An (a, /3, 7)-partition hierarchy is a sequence of partitions starting from the trivial partition 
in which each vertex forms its own cluster, and the ith partition is an (a, /3, 7*) -partition and coarsens the 
{i — l)th partition. (See Section|2]for formal definitions.) The significance of our framework stems from the 
following result. 

• From partition hierarchies to USTs: For any graph G, given an (q, /3, 7) -partition hierarchy for G, 
an 0(a^/3^7log^ n)-stretch UST for G can be constructed in polynomial time (Section [3711 ). 

A major consequence of the above result is that a (polylog(n),polylog(n),polylog(n))-partition hierarchy 
implies a polylog(n)-stretch UST. At a high-level, our approach of using partition hierarchies to derive 
USTs is similar to that of II2TI . There is a critical difference, however, since the natural divide-and-conquer 
approach of constructing the UST by connecting together subtrees recursively computed for lower levels of 
the hierarchy does not work. In fact, it can be shown that there exist graphs and hierarchies such that any tree 
that completely obeys the connectivity structure of the hierarchy in the sense that the subgraph of the tree 
induced by every cluster of the hierarchy is connected will have poor stretch. We show, however, that we 
can get the desired bound on stretch by guaranteeing that for any cluster in the given hierarchy, even though 
the tree may be split within the cluster it is joined externally so as to approximately respect the distances 
within the cluster. 

A natural question to ask is whether (a, /3, 7)-partitions or the corresponding hierarchies, with low 
values of a, /3, and 7, are necessary to achieve low-stretch USTs. We provide a partial affirmative answer to 
this question with the following result. 

• From USTs to partitions: If every graph has a (5-stretch UST, then for any real 7 > 0, every graph 
has an (0((5^), 0(5), 7)-partition (Section ll!2l). 

We next obtain our main results for general graphs and minor-free graphs by constructing suitable partition 
hierarchies. 

• Partition hierarchies for general graphs: Every graph G has a polynomial-time computable 
(20(v^)^ 2'^(v^), 2'^(v^)).partition hierarchy (Section|l. 

• Partition hierarchies for minor-free graphs: Every minor-free graph G has a polynomial-time 
computable (0(log^ n), 0(log^ n), 0(log'^ n))-partition hierarchy (Section|6ll. 
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The partition hierarchy for general graphs is obtained by an iterative procedure in which clusters are contin- 
ually merged by identifying vertices for which the number of intersecting clusters within a specified distance 
exceeds the desired bound. The particular order in which the vertices are processed is carefully chosen; a 
natural greedy approach fails. 

Our construction of the partition hierarchy for minor-free graphs is much more complicated. It is based 
on a separator theorem due to lISTl 121 which builds on the influential work of [22] and shows that any 
minor-free graph can be decomposed into connected components, each of which contains at most half the 
number of nodes, by removing a sequence of a constant number of shortest paths. A key ingredient of our 
hierarchical construction for minor-free graphs is a result on cluster aggregation in general graphs, which is 
of independent interest. 

• Cluster aggregation: We show that given any partition of G into disjoint clusters each with strong 
diameter at most D, and a set S of portal vertices, we can aggregate the clusters into disjoint connected 
components, each component with a distinguished portal from S, such that for any vertex v, the 
distance, within the component of v, from v to the distinguished portal in the component is at most 
0(log2 n)D more than the distance of v to 5 in G (Section |5]l. 

1.2 Related work 

Research in network design over the past decade has revealed that it is often possible to derive sparse network 
structures (e.g., routes, multicast trees) that yield good approximations simultaneously for a range of input 
instances. One of the earliest examples of such a result is due to Goel and Estrin |15| who introduced the 
problem of simultaneous single sink buy-at-bulk and gave an 0{logD) bound on the simultaneous ratio 
where D is the total demand. The simultaneous guarantee means that their solution works simultaneously 
for all fusion cost function / which are concave and monotonically non-deceasing with /(O) = 0. In a 
related paper 1 16], Goel and Post constructed a distribution over trees such that the expected cost of a tree 
for any / is within an 0(l)-factor of the optimum cost for that /. A recent improvement by Goel and 
Post lITTl provides the first constant guarantee on the simultaneous ratio achievable by a tiee. This result is 
incomparable to our results since the set of terminals that are being aggregated in the buy-at-bulk problem 
are fixed. 

Jia et al. pTl introduced the notion of universal approximation algorithms for optimization problems 
and provided approximation algorithms for TSP, Steiner Tree and set cover problems. For the universal 
Steiner tree problem, they presented polynomial-time algorithms that construct overlay trees with a stretch 
of 0(log^ n/ log log(n)) for arbitrary metrics and logarithmic stretch for doubling, Euclidean, or growth- 
restricted metrics. They also provided a lower bound of (7 (log n/ log log n) for UST that holds even when 
all the vertices are on a plane; for general metrics, this can be improved to ri(log n) 1191 l9l. Note that these 
lower bounds extend to the UST problem on graphs. Lower bounds for universal TSP are given in 1201 [TSl . 
For earlier work on universal TSP, see [i28n 81. 

Gupta, Hajiaghayi and Racke |fT9l developed an elegant framework to model oblivious network design 
problems and gave algorithms with poly-logarithmic approximation ratios. They give network structures 
that are simultaneously oblivious to both the link cost functions (subject to them being drawn from a suit- 
able class) and traffic demand. Their algorithms are based on the celebrated tree metric embeddings of 
Fakcharoenphol et al. |, 14] and hierarchical cut-based decompostions of Racke [29]. For the UST problem 
on metrics, the algorithm of |[T9l builds a UST as follows: First obtain 0(log n) trees from the distribution 
of HH; next assign each non-root vertex to a tree that well-approximates its distances to all other nodes; 
finally, take the union, over each of the O(logn) overlay trees, the subtree of the tree induced by the root 
and the vertices assigned to the tree. The resulting overlay tree is an 0(log^ n)-stretch UST. 

A potential approach to solving the UST problem on graphs is to adapt the techniques of |[T9l with 
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O(logn) spanning trees drawn from the distributions of [] instead of the overlay trees of lfT4l . A major 
challenge here is that the paths or subtrees chosen from the different O(logn) may share several vertices 
and hence create unavoidable cycles when combined. The only prior work on constructing universal Steiner 
trees for graphs is due to Busch et al. |[30l who achieved a stretch of O(log'^n) for the restricted class of 
graphs with bounded doubling dimension by showing how one can continually refine an O (log n) -stretch 
overlay tree by removing cycles to obtain an 0(log^ n)-stretch UST. Their techniques, however, are closely 
tied to the particular class of graphs and seem difficult to generalize. 

The aforementioned problem of approximating a graph metric by a tree metric has a rich history. Alon 
et al. 101 showed an upper bound of 

2Vlognloglogn 

for approximating an arbitrary graph metric by a distribu- 
tion over spanning trees. Bartal (6] showed that an improved 0(log^ n) approximation is achievable using 
tree metrics if one drops the requirement that the trees be subgraphs of the underlying graph. Konjevod et al. 
ll23l improved Hartal's result to 0(log n) for planar graphs while Charikar et al. [12] improved it for low di- 
mensional normed spaces. Subsequently, Bartal Q improved his earlier result to 0(log n log log n) and also 
showed a lower bound of il(log n) on the distortion for probabilistically embedding an expander graph into a 
tree. Fakcharoenphol, Rao and Talwar [14] closed the gap between the lower and the upper bound by show- 
ing that arbitrary metrics can be approximated by a distribution over tree metrics with distortion O(logn). 
More recently, EMn, Emek, Spielman, and Teng |[T3l showed an upper bound of 0(log^ n log log n) for 
approximating an arbitrary graph metric using a distribution of spanning trees, thus significantly improving 
the result of Alon et al [4]. This result was subsequently improved by Abraham, Bartal, and Neiman [1] 
who achieved an 0(log n log log n(log log log n)^) bound. 

As mentioned in Section [LT] our universal Steiner trees are based on certain partitions of graphs where 
we would like to bound the strong diameter of the clusters while maintaining some sparsity constraints. Such 
partitions have been extensively studied ['271, '51. While nearly optimal partitions based on weak diameter 
bounds are known in many cases, strong-diameter based decompositions are less understood [27]. There 
have been recent results on strong-diameter decompositions |[T] |3l; while our partitions share some of the 
ideas (e.g., of stitching together judiciously chosen shortest paths), there are significant differences in the 
details and the particular partitions being sought. Furthermore, while we seek partition hierarchies with 
deterministic guarantees, these previous results concerned hierarchies with either probabilistic guarantees 
or covers where clusters are allowed to overlap. 

2 Definitions and notation 

Let G = {V, E, w) denote a weighted undirected graph, where V and E are the sets of vertices and edges, 
respectively, and w : ii^ — M is a weight function on edges. For any u and v in V, let the distance between 
u and V, denoted by d{u, v), be the weight of a shortest path between u and v, according to w. For v ^ V 
and real number p, let B{v, p) denote the ball of radius p centered at v, i.e., B{v, p) is the set of all vertices 
that are at distance at most p from v, including v. For any graph G, let diameter Diam(G) denote the 
maximum distance between any two vertices of G. For any graph G and any subset X of vertices in G, let 
H[X] denote the subgraph of G induced by X. For any subset X of vertices and vertices u and v in X, let 
dx{u, v) denote the distance between u and v in 

Universal Steiner trees. We now introduce some notation that helps formalize the universal Steiner tree 
problem. Given a specified root vertex r £ V and a set of terminal vertices X C y, a Steiner tree T for 
X is a minimal subgraph of G that connects the vertices of X to the root. The cost of tree T, denoted by 
COST(r), is the sum of the weights of edges in it. Assume G and r to be fixed. We let Opt(X) denote the 
cost of the minimum weight steiner tree connecting X to r. Given a spanning tree T of G and terminal set 
X, we define its projection on the terminal set X, denoted by Tx, as the minimal subtree of T rooted at r 
that contains X. 
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Definition 1 (Universal Steiner tree). Let G be an undirected weighted graph, and r be a specified root 
vertex in V. We define the stretch of a spanning tree T of G to be maxxcy '^Opt(x) universal 
Steiner tree problem is to find a spanning tree with minimum stretch. 

Note that the requirement that the root be included in the set of terminals is indispensable, otherwise we 
cannot have a tree with better than 0(n) stretch which can be easily seen from the example of a ring. 
Partitions. A partition "P of y is a collection of disjoint subsets of V whose union equals V . We refer to 
any subset of vertices, and hence each element of V, as a cluster of the graph G. There are two notions for 
the diameter of a cluster. This paper focuses on the strong diameter, which is the diameter of the subgraph 
induced by the cluster. In contrast, the weak diameter of a cluster is simply the maximum distance between 
any two verices of the cluster in G. 

Definition 2 ((a, /3,7)-partition). For any real j > 0, an (a, P,j)-partition V of G is a partition ofV 
satisfying the following properties. 

1. Strong diameter: The strong diameter of every cluster G in V is at most 07; i.e., Diam(G[C] ) < aj. 

2. Cluster- valence: For every vertex v in V, B(v, 7) has a nonempty intersection with at most (3 clusters 
in V. We refer to /3 as the cluster-valence ofV. 

A notion of partition similar to our (a, /?, 7)-paitition appeared in Jia et al. II2TI . which required a bound 
on the weak diameter of clusters. 

Definition 3 (Partition hierarchy). For a given real 7 > 1, a (a, /3, 'y)-partition hierarchy of a graph G is 

a sequence < Vq,Vi, . . . ,Vd > of partitions ofV, where d = log^( ^^^^^'^^ ), satisfying the following 
properties. 

1. Partition: For < i < d, Vi is an (a, f3, -partition of G. Furthermore, Vd is the collection {V}. 
For convenience, we set V-i to the set {{v} \ v G V}. 

2. Hierarchy: For < i < d, every cluster in Vi is a subset of some cluster in Vi+i. 

3. Root Padding: For < i < d, the ball i?(r, 7*) of radius 7* around root r is contained in some 
cluster ofVi. 

For a partition V, let 'P(w) denote the cluster of V that contains v and MaxDiam('P) denote niay^ceV Diam(C). 
For a subset X of vertices, let V [X] denote the partition restricted to X; i.e., V [X] equals {X n C | C G V}. 
For a partition hierarchy H and a cluster G that is an element of a partition ViinT-L, we let 'H[C] denote the 
partition hierarchy projected to G ; that is, 

=<Vo[G\,...,Vi[G\ > . 



3 Strong partitions and Universal Steiner trees 

In this section, we present close connections between the strong partitions of Definition [2] and universal 
Steiner trees. We first show in Section [3?T] that for any graph G a strong (a, /3, 7)-partition hierarchy yields 
an (a^/3^7log^ n)-stretch spanning tree for G. We next show, in a somewhat weaker sense, that partitions 
with low strong diameter and low cluster- valence are necessary for deriving low-stretch trees. In particular, 
Section [J!2l shows that if every graph has a cj-stretch spanning tree, then every graph has a (cr, cr^ , 7)-partition 
for all 7 > 0. 
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3.1 From a partition hierarchy to a universal Steiner tree 



The main result of this section is an algorithm to construct an 0(a^/3^7log^ n)-stretch UST from an 
(a, /3, 7)-partition hierarchy. We say that a spanning tree T of G fi-respects a (a, /?, 7)-partition hierar- 
chy {Vi} if for any i, any cluster C of Vi, and any vertices u,v £ C, dxiu, v) is at most fiaj^. 

Lemma 4. Any spanning tree T that fi-respects an (a, f3, ^-partition hierarchy has a stretch ofO{fj,af3'j log^ n). 

Proof. Let {Vi) denote the given (a, /3, 7)-partition hierarchy. Fix a set X of vertices, that contains the root 
r and is of size at least two. For each cluster C in the strong partition hierarchy such that CnX is nonempty, 
let ^(C) denote an arbitrary vertex inC Ci X. 

Let Tx denote the subtree of T{G) induced by X. We place an upper bound on the cost of Tx as 
follows. Let Ui denote the number of clusters in that X intersects. For convenience, we define n_i to be 
\X\ and V-i to be the trivial clustering consisting of a singleton set for each vertex. Let j be the smallest 
integer such that X is a subset of the cluster in Vj that contains r. Thus nj equals 1 and rij^i > 1 for all 
< i < j. Fix an i, < i < j. Let C be any cluster of Vi that intersects X, and let C denote the cluster of 
Vi+i that contains C. Since T ^u-respects the partition hierarchy, it follows that the length of the path from 
i{C) to i{C') in T is at most fiaj^^^. Therefore, the cost of Tx is at most X]o<j<j riifia^'^^. 

We next place a lower bound on OPT{X). Fix an i, < i < j. By the cluster-valence property 
of the hierarchy, any ball of radius 7* intersects at most (3 clusters in Vi. Thus, there are at least \ni/ jS'] 
vertices in X that are at pairwise distance at least 7* from one another. This implies that OPT{X) is at 
least ( [nj//?] — 1)7*. If [nj//?] = 1, we invoke the padding property which says there is at least one vertex 
in X that is at distance 7j from the root, implying a lower bound of 7* on OPT{X). Combining the two 
bounds, we obtain a lower bound of U,{niY / f3). For i = —1, we also have a lower bound of n_i (which in 
fact supersedes the lower bound for i = 0). Thus, the stretch of T{G) is at most 



By a more careful analysis using standard techniques, we can replace the log Diam(G) factor by log n; we 



Appendix lAl presents a bottom-up divide-and-conquer algorithm for constructing a spanning tree T that 
completely obeys the partition hierarchy in the sense that for any cluster C of the hierarchy, T[C] is, in fact, 
a tree; that is, T completely respects the connectivity of each cluster of the hierarchy. In doing so, however, 
it pays a huge cost in the distances within the cluster. And, in fact, this is unavoidable; one can construct 
examples of strong partition hierarchies where obeying the connectivity of each cluster in the tree will result 
in a significant blow-up of stretch. 

We now present a much more careful construction of a universal Steiner tree which does not enforce 
the connectivity constraint within clusters; that is, we use a given partition hierarchy T-L to build a tree T in 
which T[C] may be disconnected. By allowing this disconnectivity within clusters, however, we show that 
we can build a tree that //-respects the given hierarchy for a much smaller ^, assuming 7 is sufficiently large. 
The pseudocode is given in Algorithm [U We have presented the algorithm in a more general context where 
the goal is to compute a forest rooted at a given set of portals. To obtain a UST, we invoke the algorithm 
with the portal set being the singleton set consisting of the root. 

Before presenting our algorithm, we introduce some useful notation. For a partition V of a graph H, let 
H[V] denote a graph in which the vertex set is V, and there is an edge (C, C) from cluster C to C" if H 
has an edge between a vertex in C and a vertex in C"; the weight of the edge {C,C') is the weight of the 
minimum-weight edge between C and C in H. 




defer the details to the full paper. 



□ 
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Algorithm 1 UST: The split and join algorithm 



Require: Undirected graph G = iy,E), a set Sq C F of portals, a partition hierarchy % = 

Ensure: A forest F that coimects every vertex in F to 5. 
1: For an edge e = (Ci, C2) in G\Pg\, let m(e) denote the minimum-weight edge from C\ to C2 in G. 
Let 5 denote the set of clusters that have a nonempty intersection with S. 
for cluster C in S* do 

Set Sc to be C n S. 
end for 

Compute a shortest path forest F in G[Pe\ rooted at S. 
for cluster C in Ve. in order of decreasing distance from 5 in F do 
if C is a leaf node in F then 

Set RANK(C) to be 0, Sc to be {tail of m(e)} where e is the edge connecting C to its parent in F. 
else 

Let MAXC be max{RANK(C") | C' is child of C}. Set fav(C) to be a child of C with rank maxc. 
Set highway(C) to be a shortest path from the head of m(e) to the tail of m(e') where e and e' 
are the edges coimecting fav(C) to C and C to its parent, respectively, in F. Set Sc to be the set 
of nodes in HIGHWAY (C). 

if there exist at least two children of C in F whose rank equals MAXC then 

Set rank(C) to be maxc + 1 
else 

Set rank(C) to be maxc 
end if 
end if 
end for 

for each cluster C in do 

Compute F{C) = UST(G[C], {Sc U Sc) n C,U[C]) 
end for 

Return F to be the union of UceP^ highway (C) and UceP^ -^(^) 



Lemma 5. Let F be the final forest returned by the algorithm. For any cluster C, when UST is called on 
cluster C, either Sc is a subset of S or for any two vertices u and v in Sc, dpiu, v) is at most dc{u, v). 

Proof We first prove that for any cluster C, the set Sc is exactly one of the following: (i) S, if C is G; or 
(ii) a subset of Sc for the parent cluster C' of C; or (iii) a subset of nodes on a highway (C) constructed 
when processing parent cluster C'. The proof is by induction on the level of the hierarchy. The base case is 
trivial for C being the whole graph. For the induction step, consider level i < ^ of the hierarchy, and let C 
be a cluster in T'j. 

We consider two cases. In the first case, C intersects Sc where C' is the parent cluster for C. In this 
case, Sc is set to the intersection of C and Sc' as desired. In the second case, C is disjoint from Sc ■ In 
this case, Sc is simply the set of vertices in highway (C), again completing the induction step. 

To complete the proof of the lemma, consider a cluster C in Vi, for some i. If S'c is a subset of S, the 
lemma trivially follows. If Sc is not a subset of S, then by the above claim, Sc is a subset of nodes on a 
highway h in C. By our construction, /i is a shortest path in C. Since h is part of F, it follows that for any 
two vertices u and v in Sc, dF{u, v) equals dc{u, v). □ 

Lemma 6. The rank of any cluster C in partition Ve is at most logdT'^l). 
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Proof. Let F denote the shortest path forest in G[Pt\- We show that for any cluster C, the rank of C is at 
most log (mc), where mc is the number of nodes in the subtree of F rooted at C. 

The proof is by induction on the height of C. The induction basis is immediate for the leaves of F. 
We now consider the induction step. For cluster C, let r denote the rank of the child with highest rank 
among all children of C. Let S denote the set of children of C that have rank r. We note that mc is at least 
^ + Y1,C'&S ^-c- Furthermore, by the induction hypothesis, mc is at least 2'', for each C" in S. We consider 
two cases. If IS*! is 1, then the rank of C equals r, which is at most log(mc) by the induction hypothesis. In 
I SI > 2, then the rank of C equals r + 1; since mc is at least 1 + 2 • 2*" > 2^'"*"^, the induction step follows, 
completing the proof. □ 

Lemma 7. Let F be the final forest returned by the algorithm. If^> logn, then for any cluster C in Vi 
and vertex u in C, dp{u, Sc) is at most 3a'^l3Y- 

Proof. We prove by induction on level i that dp{u, Sc) is at most 3a'^f3^\ with the base case being i = 0. 
In this case, the cluster and its portal set are the same singleton vertex set, trivially yielding the desired 
claim. For the induction step, we consider i > 0. Let C be a cluster of Vi. For any vertex u in C, let Cu 
denote Vi-i{u), that is, the cluster in partition Vi-i that contains u. 

As in the algorithm, let S denote the set of clusters in the partition of C that intersect 5c- Let Cu = 
CqjCi, . . . ,Ck, where G S, denote the sequence of clusters in the unique path from C„ to S in the 
supergraph. Note that C,; is the parent of Cj_i in the supergraph. By our earlier argument, we know that k 
is at most a^j. We now argue that there are at most log n elements Cj in the sequence such that Cj is not 
FAv(Cj+i). To see this, we note that if Ci is not FAv(Ci+i), then RANK(Cj+i) strictly exceeds RANK(Cj). 
Since the rank of any cluster is at most log n by Lemma[6l the desired claim holds. 

This sequence of clusters induces a path from n to ^c, which consists of (a) the connecting edges in the 
supergraph, (b) the highway in each cluster Cj in the sequence, (c) for each cluster Cj such that Ci_i is not 
a favorite of Ci, the unique path in F that connects the head of the edge connecting Cj-i and Cj to Sd- 
Since the number of clusters in the sequence is at most a/37, and the highway in each cluster is a shortest 
path, the total length of the paths in (a) and (b) is at most 2a^/37*. The number of clusters in (c) is at most 
log n, and by the induction hypothesis, the length of each path in (c) is at most 3a^/37*^^. We thus have, 

dF{u,Sc) < 2a2/3y + 2 log na^/Sy-^ 
< Sa^/jy 

for 7 > 2 log n, thus completing the proof of the lemma. □ 

Lemma 8. For any cluster C in Vi, and vertices u, v in C, dpiu, v) is at most 7q!^/37*. 

Proof. By Lemma|7J dp{u, Sc) and dp{v, Sc) are both at most 3a^/37'. By Lemma[5l for any two nodes x 
and y in Sc, dp{x, y) is at most the strong diameter of C, which is at most 07*. Putting these three distances 
together, we obtain that dp{u, v) is at most 7a^/3Y. □ 

Theorem 9. Given an undirected graph G, portal set S = {r}, where r is an arbitrary vertex of G, and 
(a, /3, 'y)-partition T-L ofG as input, Algorithm\l\retums a universal Steiner tree T of stretch 0(q;^/3^7 log^ n) 

Proof. Since S has only one vertex, the forest F returned is a tree. By Lemma [H for any cluster C in any 
partition at level i of H, and any two vertices u and v in C, we have dp{u, v) is at most 7a^/37*. It thus 
follows that F (7a/3) -respects T-L. By LemmaHl we obtain that F has stretch 0(a^/3^7log n). □ 
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3.2 From universal Steiner trees to strong partitions 

We show how to construct partitions with low strong diameter and low cluster-valence for all graphs given 
universal Steiner trees for all graphs. We refer to Appendix lAl for the proof. 

Theorem 10. Given a 5-stretch universal steiner tree algorithm A for all graphs, we can construct a 
polynomial-time (0((5^), O {5),^) -partition algorithm A!, that works for all graphs and all 7 and uses A as 
a black box. 

4 Partition hierarchy for general graphs 

In this section we present our algorithm for obtaining a partition hierarchy for general graphs. As mentioned 
in section |2j we start the hierarchy at level —1 by defining V-i as the trivial partition where every vertex is 
in its own cluster. 

Algorithm. For i = 0, . . . , [log^ DlAM(G) -| ^ build the ith level of the hierarchy, Vi, after building 
the previous levels. Assuming that the level i — 1 partition Vi-i has been constructed, we construct Vi as 
follows. 

Clusters of Vi are formed in successive stages starting from stage 0. We assign a rank to each cluster 
based on the stage in which it is created: a cluster formed in stage ij gets the rank j. (All the clusters of 
level —1 are assigned the rank 0.) We will denote the set of clusters of rank j (of level i) by 5j. A cluster of 
a higher rank is formed by merging clusters of lower ranks. At all times, we maintain a partitioning of the 
graph, i.e., we guarantee that each vertex of the graph is contained in exactly one cluster of level i. 

In stage 0, we simply add all the clusters of Vi-i to S^. For j > 1, stage j works in two phases as 
follows. In the first phase, we repeatedly look for a vertex contained in a cluster of rank at most j — I such 
that the ball of radius 7* around it, i?(i),7*), intersects more than clusters of rank precisely j — I. If 
we find such a vertex v, we merge the cluster containing v with all the clusters of rank j — 1 that 7*) 
intersects. This newly created cluster is assigned the rank j and added to 5* while all the clusters that were 
merged to form it are deleted from their respective Sys. The first phase ends when we can no longer find 
any such vertex v. 

In the second phase, we repeat a simlar procedure for vertices contained in clusters of rank j. As long 
as we can find a vertex in a cluster of rank j such that B{v, 7*) intersects more than nfe clusters of rank 
j — 1, we merge all these clusters of rank j — 1 with the cluster containing v to form a new cluster of rank j. 
We include this new cluster in 5* and delete all the clusters that were merged to form it from their respective 
Sys. The second phase, and also the stage j, ends when we cannot find any such vertex v, and the next 
stage begins. 

If no new cluster gets formed in the first phase of a stage, the construction of level i of the hierarchy 
finishes and Vi is defined as simply the union of all the non empty 5* 's. Remark. Although the two phases 
of a stage are quite similar and one might be tempted to do away with this particular ordering of mergings, 
the naive approach without the ordering does not work. Having a careful order in which mergings are carried 
out enables us to control the growth of the strong diameter of the clusters. To see this, consider a cluster 
formed in the second phase of some stage j. It contains a unique cluster that was formed in the first phase 
of stage j (call it the core). Our ordering ensures that only the vertices in the core can lead to mergings in 
the second phase of stage j. This is because for any vertex v outside the core, , 7*) intersects at most 
nk clusters of rank j — I, otherwise the first phase would not have ended. Thus the mergings of the second 
phase cannot increase the diameter much as the new vertices are always "close" to the core. 

The formal pseudocode of our algorithm and the proof of the following theorem are deferred to Ap- 
pendix |B] Our UST result for general graphs follows as a coroUary of Theorems [TT] and l9l 
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Theorem 11. For any graph G, a hierarchical ((| + e)4^ ^ — |, fcnfc , ^)-partition can beconstructed in 
polynomial time for 7 > ^((l + e)A^~^ — |). In paricular, setting k = \^log n , we obtain a hierarchical 
(20(VT5^) 20(v^)^ 2'^(v^)).^^rt/f/o«/or any graph in polynomial time. 

Corollary 12. A 2^^^^°^^^ -stretch spanning tree can be computed in polynomial time for any undirected 
graph. 

5 The Cluster Aggregation Problem 

In this section, we define tlie Cluster Aggregation problem which arises when building partition hierarchies 
for minor-free graphs (see Section O. Our problem formulation and algorithm, however, apply to arbitrary 
graphs and may be of independent interest. Indeed, our cluster aggregation algorithm is useful for building 
other strong-diameter based hierarchical partitions with applications to distributed computing 1 10 1. 

Definition 13 (Cluster Aggregation). Given a graph G = {V, E), partition V of G, set S (^V 0/ portals, a 
cluster aggregation is a function DEST : V ^ S. The function DEST naturally induces a new partition Q = 
{{Jc DEST{C)=s C \ s £ S} that coarsens V. For each vertex v in V, we define the detour DTRdesj {v)for v 
under DEST to be the difference between the distance from v to S in G and the distance from v to DEST'P(f ) 
in subgraph ofG induced by the cluster in Q that contains v; i.e., DTRDES7(t;) = {dQ^Q^j {v, DEST(C)) — 
d{v, S)). We define the detour of DEST to be max^^v dtRq(i;). The goal of the cluster merging problem is 
to find a cluster aggregation with minimum detour. 

We defer the algorithm and proof of the following theorem to Appendix O 

Theorem 14. The detour for any vertex v in G in the cluster merger returned by Algorithm is at most 
log2(|P|)MAXDlAM(P). 

6 Partition Hierarchy for Minor-free Graphs 

A weighed graph G is ff-minor free if zero or more edge contractions on G does not give a graph isomorphic 
H. Minor-free graphs are special cases of /c-path separable graphs. A graph G is k-path separable [2| if 
there exists a subgraph S, called the k-path separator, such that: (i) S = S*! U 5*2 U • • • U S*;, where for each 
1 < i < I, subgraph Si is the union of ki paths where each path is shortest in G \ IJi<j<i '^j '^i^h respect to 
its end points; (ii) ^ ■ ki < k; (iii) either G \ is empty, or each connected component of G \ 5 is A; -path 
separable and has at most n/2 nodes. 

Thorup on shows that any planar graph G is 3-path separable, where all paths in the separator S belong 
in Si, that is, they are shortest paths in G. Abraham and Gavoille |2| generalize the result to any //-minor 
free graph, for fixed size H, is /c-path separable, for some k = k{H), and the A;-path separator can be 
computed in polynomial time. Interesting classes of i/-minor free graphs are: planar graphs, which exclude 

and i^3,3; outerplanar graphs, which exclude K4 and i^2,3; series-parallel graphs, which exclude K4; 
and trees, which exclude ^3. They also show that the path separator can be computed in polynomial time. 

Algorithm. Consider now an arbitrary weighted //-minor free graph G, for fixed size H. (You may also 
take G to be an arbitrary A; -path separable graph.) We will construct a hierarchical (a, /3, 7)-partition of G 
which is based on forming clusters around the path separators of G. The concept of creating clusters around 
path separators has been introduced by Busch et al. lOTl in the context of sparse covers in /c-path separable 
graphs. Here, we extend that technique to hierarchical partitions. 

We build the hierarchical partition bottom up by coarsening clusters. Suppose we are given a (a, /?, 7*^^ )- 
partition Vi-i. We describe how to build a (a, /3, 7*)-partition Vi, such that Vi-i is a refinement of Vi. 
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The first clusters of partition Vi are formed around a /c-path separator of G by appropriately merging 
clusters of Vi-i close to the separator paths. We then remove the A;-path separator and repeat the clustering 
process recursively to each residual connected component, until no nodes are left in the graph. 

Consider a connected component <I> which appears during the recursive decomposition of G. Let S = 
S*! U ^2 U • • • U 5/ be the path separator of We process the paths of S in sequence starting from the paths 
in 5i, then the paths in S2, and so on. We maintain the new formed clusters in a set M, which is updated 
every time we process a new path. 

Consider now the path p G S^. Let ^' be the connected component of $ \ Ui<j<r which p resides. 
Denote Vf_i C Vi-i the integral clusters of Vi-i which are completely contained in We define the 
following subsets A and B of Vf_i such that: A contains all clusters of Vf_i not yet included in J\f within 
distance 27* from pm'^;B contains all the clusters of A which are adjacent to clusters in M (where M are 
the clusters which have been formed so far form paths processed before p). 

Let be the sub-graph induced by A (note that ^' may not be connected). Combine the clusters in A 
by invoking the cluster aggregation algorithm of Section [S] to each connected component of ^' . We define 
two sets of nodes L and U in which will serve as portals around which new merged clusters will be 
formed. Set L contains the leaders of path p, which is a maximal set of nodes in p n such for any pair 
u,v & L, dp{u, v) > 7% and u and v cannot belong to the same cluster of A. Set U contains one arbitrary 
node from each cluster in B. Combine the clusters in A by invoking the algorithm of Section \5\ to each 
connected component of for the induced clusters from A and the induced portal nodes in LU U. 

Let TZ contain all resulting clusters from invoking Algorithm |4] We can write TZ = IpU JCp where Ip 
consists of clusters that contain a node of L, and ICp consists of clusters that contain a node of U. Each 
cluster X £ ICp\Ip merges further with at most one arbitrary adjacent cluster Y G M, for which there is 
an edge {u, v) G E{'^) such that u £ X, v £ Y , and v ^ We insert the merged cluster from X and Y 
back to M. The returned set of clusters from processing path p is A/" = A/" U Xp. 

The algorithm is initially invoked with $ = G and M = $. The resulting partition Vi is the final M that 
we obtain after we recursively process all the path separators in G. We defer the formal pseudocode of the 
algorithm and its complete analysis to Appendix iDl 

Theorem 15. We can obtain a hierarchical (0(log'^ n), 0(log^ n), 0(log^ n))-partition of minor-free graph 
G in polynomial time. 
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A Proofs for Section |3] 



Algorithm 2 A basic divide-and-conquer algorithm 

Require: Undirected graph G, strong (a, /3, 7)-partition hierarchy {Vi : < i < A;} for G 
Ensure: A spanning tree T of G 

1: for level i from to do 

2: for cluster C in Vi do 

3: For an edge e = (Ci, C2) in G'['Pj_i [C]], let ms{e) denote the edge between Ci to C2 in G that 

has minimum weight. (Recall that Vi-i [G] is the partition Vi-i, restricted to the set C.) 
4: Compute a shortest path tree T' from an arbitrary source vertex in Gs- 
5: Set T{G) to be the union of Uc'(isT{G') and {^^(e) : e G T'}. 
6: end for 
7: end for 

8: Set T to be TiV) (note that V is the lone cluster in Vk)- 



Theorem 16. For any graph G, given an (a, /?, ^)-partition hierarchy, Algorithm^yields an 0((a/3)'°^T "7/?^ log 
stretch universal Steiner tree in polynomial time. 

Proof. Given a graph G, the algorithm builds a spanning tree T{G) by iteratively building spanning trees 
for each cluster in a (a, /3, 7)-partition hierarchy of G, in a bottom-up manner. We first show by induction 
on i that for any cluster C G Vi, the strong diameter of T(C) is at most (a/37)*+^ — 1. The induction basis 
directly follows from the strong diameter property of an (a, /3, l)-partition. 

We now establish the induction step. Let G be a cluster in Vi and let u and v be two vertices in G. By 
the hierarchy property, G is the union of a set, say S, of clusters in Vi-i. Since Vi is an (q, /3, 7*) -partition, 
it follows that the strong diameter of G is at most a^- Hence, there exists a path P between u and v in G 
of length at most a7*. By the intersection property, any ball of radius 7*^^ intersects at most /3 clusters in 
Vi-i, and hence at most /3 clusters in S. Therefore, the path P intersects at most a/37 clusters in S. Thus, 
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the diameter of Gs is at most a/3j. By the induction hypothesis, it follows that the strong diameter of T(C) 
is at most a/37 — 1 + a/57((a/37)' — 1), which equals {a/^^Y^^ — 1. 

Since (a/37)*"'"^/(a7*) is at most a^f^'^'^'^j for k = log^n, it follows that T (a'^~^/3'^"'"^7)-respects 
the strong partition hierarchy. By LemmalU we obtain that T has stretch at most 0((a/3)^°^T^ "7/3^ log^ n), 
completing the proof of the theorem. 

□ 

Proof of Theorem [TOl Assume we have algorithm A that finds a (5-stretch universal Steiner tree for 
all graphs. The algorithm A' works as follows. Given graph G = {V,E,w), it constructs graph G' = 
{V, E\ w') where V = V U {r}, E' = E U {(r, v) : v ^ V} and w' extends w to E' by simply assigning 
w{{r,v)) = 2^7 for all v ^ V. Note that r is an additional vertex not in V. A' provides graph G' and 
root vertex r as input to A. Let T be the tree rooted at r output by A and Ti , . . . , Tjt be the subtrees of T 
connected directly to the root r by single edges. A' simply outputs the partition V = {Ci, . . . , C^}, where 
Gi is the set of vertices in Tj. We now argue that "P is a (0(5^), 0(5), 7)-partition of G. 

Lemma 17. The strong diameter of each Gi is at most A6{6 — 1)7. 

Proof. Fix a Cj. It is enough for us to prove that the height of the tree Tj is at most 25{5 — 1)7 as we can 
reach any vertex in Ci from any other by going through the root of Tj. Assume not. Then there is a vertex v 
in Ti at a distance more than 25{5 — 1)7 from the root of Tj. Consider the graph G' with the root vertex r 
for which A returned T. COST(r{^,}) is more than 26^ + 25{delta - 1)7 = 2(5^7, while OPT({f }) is 2*^7. 

Thus '^Qp-^l^^il > which contradicts the fact that ^ is a (5-stretch universal Steiner tree algorithm. □ 
Lemma 18. For any vertex v € V, B{v,'j) intersects at most 25 clusters ofV. 

Proof. The proof is by contradiction. Suppose there is a vertex v such that B{v,j) intersects d > 26 
clusters of V. We select one vertex from each of these d different clusters such that the selected vertices lie 
in B{v,j), and call this set S. Now consider the graph G' with the root vertex r for which A returned T. 
Since each vertex in S lies in a different Tj in T, COST(rs') is at least 2^7^. On the other hand, Opt(S') is 
at most 2(57 + ^7 = {26 + d)j as u is at a distance 26j from r and each of the d vertices in S are at most a 
distance 7 away from v. Thus Qpj^^^ly^^ ~ Wfd > by our choice of d, which again contradicts the fact 
that ^ is a 5-stretch universal steiner tree algorithm. □ 

The theorem follows directly from the above two lemmas. We note that while our reduction from a 
partition hierarchy to a universal Steiner tree of Section |3T] preserves the input graph instance, the above 
reduction does not. □ 



B Proofs for Section |4] 

We first give the algorithm in pseudocode. 

We have the following claims that bound the size and diameter of the clusters of level i. 

j_ 

Lemma 19. The size (number of vertices) of a cluster of rank j at any level is at least nk. 

Proof. We prove the claim using induction on j. For j = 0, the claim follows trivially as each cluster of 

any rank has size at least 1. For the induction step, observe that a cluster of rank j contains more than 

j-i 

clusters of rank j — 1 which all have size at least n fc by the induction hypothesis. □ 
Corollary 20. The rank of any cluster of any level can be at most k — 1. 
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Algorithm 3 Algorithm to obtain a partition hierarchy for general graphs 
Require: A weighted graph G = (V, w), integer /c, 7 > + e)4^^^ — |) 
Ensure: A hierarchical (q = (| + e)^'^^^ — |, /3 = kn^ , 7)-partition of G 
1: Define V-i to be the trivial partition where each vertex of V is in its own cluster, 
i.e., P_i = {{v} -.veV}. 

2: for level i from to [log^(^^^^^^)] do 

4: S] = for alll < j < A; - 1. 
5: j ^ 1. 

6: while j < A; and 7^ do 

7: while there exists a -y such that v e Cy for some G S'j^ and j„ < j, 

and B{v, 7*) intersects more than nk clusters from S'^_i do 
8: Delete C„ from 5j , i.e., S] = S] \ {GJ. 

9: Delete all the clusters of Sj_i that B{v, 7*) intersects from it, i.e., 

S]_^ = S]_^ \{G:Ge S]_^ A B{v, f)nG^ 0}. 
10: Merge and all the clusters deleted from Sj_^ and add to S^, i.e., 

sj = S) u {u{c : (c = c„) V (c G A f ) n C 7^ 0)}}. 

11: end while 

12: while there exists a v such that v € for some Gy G 5^, 

and B{v, 7*) intersects more than nk clusters from S'j_]^ do 
13: Delete C„ from S*], i.e., S] = S] \ {Gy}. 

14: Delete all the clusters of that B(v^ 7') intersects from it, i.e., 

= \ {C : C G A 5(t;,y) n C 7^ 0}. 
15: Merge C„ and all the clusters deleted from 5'*_^ and add to S**, i.e., 

s-;. = S] u {u{c : (c = c„) V (c G A B{v, f ) n C / 0)}}. 
16: end while 
17: 3=3 + 1. 
18: end while 

19: V, = Ulzt'Sl 
20: end for 

21: Output (Po,-- •,7'^, ,D1AM(G).J- 



Proof. From the previous lemma it follows that at any level there can be at most = n clusters of 

n * 

rank k — 1 which immediately implies that no cluster of rank k gets formed. □ 

Lemma 21. The strong diameter of every cluster of level i and rank j is a? moj? 7*((| + e)4-^ ~ |)' Provided 

7>i((| + ^)4'-'-|)- 

Proof. We prove the claim by induction on i and j. The case for i = —1 is trivially true. For the case of 
i > 0, assume the claim to be true for every cluster of level Since a cluster of level i and rank is simply 
one of these clusters, its diameter is bounded by 7*~^ ((| + e)4^~^ — |) by the induction hypothesis and the 
above corollary. This is at most 7'((| + e)4° — |) = 7*e by our assumption that 7 > ^((l + 6)4*^"^ — |) 
which proves the claim for level i and rank 0. 

Now assume that the claim is true for level i and all rank at most j — 1, and consider a cluster G at level 
i and rank j. There are two cases to consider depending upon whether G was formed in the first or second 
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phase of stage j. 

If C was formed in the first phase, it is the union of the cluster containing v and all the clusters of rank 
j — I that the ball B{v,j'^) intersects, where v is contained in a cluster of rank at most j — 1. By the 
induction hypothesis, the strong diameters of all these clusters which were merged to form C are bounded 
by 7*((| + e)4-'~^ — |). This implies that any vertex in C is at most a distance 7*((| + e)4-'~^ ~ |) + 7* 
from V. Thus the strong diameter of C is at most 27' ((| + e)4-'~^ — | + 1) < 7*((| + — |) as j > 1. 

If C was formed in the second phase, it implies that there was a cluster C of rank j which was formed in 
the first phase of stage j and got merged with other clusters to form C in the second phase. By the argument 
above, the strong diameter of C was at most 27*((| + e)4^~^ ~ |) + -'-)• Furthermore, we know that any 
vertex in C either comes from C or from some cluster of rank j — 1 which intersects the ball B{v,"f^) for 
a vertex v contained in C. From the above facts and the induction hypothesis, we conclude that the strong 
diameterof Cisboundedby 27*((|+e)4J-i-|) + l)+27^+27*((|+e)4J-i-|) = V((|+e)4J-|). □ 

Now we are in a position to prove Theorem [TT] which gives the partition hierarchy for general graphs. 
Proof of Theorem [iTJ The bound on cluster diameter is given by Lemma [211 For the intersection bound, 
observe that for any level i of the hierarchy and any vertex v, the ball i?(f,7*) can intersect at most nk 
clusters of a given rank. This implies that B{v,^^) can intersect at most fen* clusters in total from level i as 
every cluster has rank between and k — I. □ 

Remark The restriction on 7 can be removed if one wants just one level of the hierarchy. 

C Proofs for Section |5] 

Our algorithm for the Cluster Aggregation problem proceeds in O(logn) phases. Each phase has a number 
of iterations. Each iteration aggregates a subset of the clusters in V and assigns the same DEST value for 
each of them. The selection of clusters in a particular iteration is based on how shortest paths from these 
clusters to S proceed through the graph. The interaction of these shortest paths is captured by means of 
auxiliary directed graph. For any directed graph K and set A of vertices in K, let in/^(A) (resp., outx(^)) 
denote the set of vertices that have an edge into (resp., from) any vertex in A. 

Lemma 22. IfVi and V^+i are the set of vertices in Di and -Dj+i at the start of phase i andi+1, respectively, 
then \Vi+i\ < \Vi\/2. 

Proof. We first note that V^+i C Vi. Furthermore, in each iteration of the ith phase, when we add inj^(T) 
to Vi+i, |in^(T)| is less than |T|, where T is a subset of Vi — Fj+i. Thus, \Vi\ — \Vi+i\ > yielding 
the desired claim. □ 

For each in S, let C(rj) denote the union of the clusters X such that DESt(X) = r^. Note that C(rj) 
may vary as the algorithm progresses. 

Proof of Theorem [l4t Let m equal \ V\, the number of clusters in V. Fix a portal r in S. We will show that 
at the end of iteration j of phase i, the following holds: 

• For any Z in V, if DESt(Z) equals r, then for each vertex v in Z, there is a path in G[C (r)] from v to 
DESt(Z) of weight at most 2{{i - 1) log(|P|) + j)MAXDlAM(P) more than d{Z, S). 

Before we establish the above claim, we show how the statement of the theorem follows. By Lemma|22j the 
number of phases is at most log m. Furthermore, the number of iterations in each phase is at most log m. 
Therefore, at termination, the detour for each cluster in V is at most 2(log^ m)MAXDlAM('P), yielding the 
desired claim. 
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Algorithm 4 The Cluster Aggregation algorithm 
Require: An undirected graph G, partition V, set S of portals. 
Ensure: A cluster aggregation dest 
1: For each set X in V, let px denote a shortest path from X to S, and let Px denote the sequence of 
clusters visited in px- 

2: For a cluster Y that appears in Px , define the position of a cluster Y in Px to be i if the number of 

distinct clusters that Px visits before first visiting y is ^ — 1. 
3: Construct an auxiliary directed graph D whose vertices are the clusters of V. For vertices X and Y, D 

has an edge from X to y if Px contains Y; furthermore, we label the edge {X, Y) with the position of 

YinPx. 

4: Set i to be and Vq to be the set of vertices in D. 
5: repeat 

6: Let Di denote the subgraph of D induced by V^. Let Ei denote the set of edges in Dj. Set V^+i to 

and D to Di. 
1: repeat {{Begin Phase i}} 
8: Let v be an arbitrary vertex in D. 
9: if i = then 

10: Set dest('u) to be the vertex in S nearest to v; 

11: else 

12: Set DEST(f ) to be dest(x) where x is a vertex in Vi-i — Vi and the label of {v, x) is the least 

among all edges from v to V^-i — V^. 
13: end if 

14: Let T denote {v] U out^({f }). 
15: repeat 

16: For each u \a D — T, and each edge {u, w) in D, remove (u, w) from D if there exists an edge 

(u, x) in D with x £ T such that the label of {u, x) is smaller than the label of (u, w). 
17: Set T equal to T U mQ{T) U outg(r U inQ{T)). 

18: until |in5(r)| < |T|. 

19: For each u in T, set DEST('u) to be equal to DEST(ti). Set V^+i to V^+i U ing(r) and remove 
ruing(r) from 5. 

20: until D is empty 

21: Increment i 

22: until Vi is {End Phase i} 



Consider an iteration j of phase i. In the following, T and D refer to the variables in the above algorithm. 
The set of clusters for which we set the dest values is given by mg{T) U outg{T U mg{T)). Every cluster 
in T shares the same DEST value, say x. By the induction hypothesis, at the start of iteration j of phase ?', 
each cluster Y in the set of clusters with DEST equal to x has a path qy in G[C{x)] from y to x of weight 
at most 2{{i - 1) log m + {j - 1))MaxDiam('P) more than d(Y, S). 

Consider a vertex Z in ing(r). Since Z is in inj5(T), its path pz contains a cluster Y in T. Let p' 
denote the prefix of the path pz that connects Z to the first occurrence of Y in pz', and let p" denote the 
remainder of the path pz- We note that every cluster that appears in p' is in out^({z}), and is, hence, also 
in out^(r U in^(T)). Thus, at the end of iteration j, p' is fully contained in G[C(x)] the subgraph of G 
induced by the set of vertices with dest equal to x. The weight of pz equals the sum of the weights of p' 
and p". The weight of py is at most the weight of p". Thus, the path from Z to x consisting of p', followed 
by a shortest path to py in Y, and followed by the path qy is entirely contained in G[C{x)] and has weight 
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at most 2{{i — 1) log m + j)MAxDlAM('P) more than the length of pz- (This is because the weight of any 
shortest path in Y is at most MaxDiam('P).) This completes the induction step of the proof. □ 

D Proofs for Section [6] 



Algorithm 5 Component clustering in minor-free graph 

Require: Connected component <1> of minor-free graph G, strong (a, ^, 7*~^)-partition Vi-i of G, set A/" 

with coarsen clusters of Vi-i. 
Ensure: Coarsening the Vi-i clusters in which are then inserted in J\f. 

1: Let 5 = Si U 5*2 U • • • U 5/ be a /c-path separator of <I>. 

2: for X from 1 to / do 

3: for each path p £ S^do 

4: Let ^' be the connected component of $ \ IJi<j<r "^x which p resides. 

5: Let VfLi = {X G Vi-i ■ X C be the integral clusters of Vi-i which are completely 

contained within 

6: Let A = {X G Vf_i : {d^{X,p) < 2f ) A (X n V{Af) = 0)} be the all integral clusters of ^' 
which have not yet been coarsen (do not belong in M) and are within distance 27* from pin"^. 

7: LetB = {X e A: 3{u, v) € E{^),u e X Av e V{Af) U (F(^) \ V{Vf_i)}, contains all the 
clusters of A which are adjacent to clusters in M or adjacent to non-integral clusters in 4'. 

8: Let ^i' = \i) n V{A) be the sub-graph of ^ induced by V{A) (note that ^' may not be connected). 

9: Let L be the leaders of path p, which is a maximal set of nodes in pCi such for any pair u,v ^ L, 
dp{u, v) > 7*, and u and v cannot belong to the same cluster of A. 

10: Let U be the set that contains one arbitrary node from each cluster in B. 

11: Combine the clusters in A by invoking Algorithm H] to each connected component of for the 

induced clusters from A and the induced portal nodes in LU U. 
12: Let TZ be the union of the resulting set of clusters from Algorithm HI 

13: Write TZ = ZpU JCp where Zp consists of clusters that contain a node of L, and ICp consists of 

clusters that contain a node of U. 
14: for each cluster X £ }Cp\Ip do 

15: X merges with at most one arbitrary adjacent cluster Y ^ M such that there is an edge (n, v) G 

Ei"^), u € X,v £Y,Sindv ^ 
16: We insert the merged cluster from X and Y back to M. 

17: end for 
18: Update TV = A/" U Xp. 
19: end for 
20: end for 

21: for each connected component T G ^ \ S do 

22: Invoke (recursively) Algorithm [5] with parameters T, Vi-i, and M. 
23: Update M to be the result of the recursive invocation. 
24: end for 
25: Return TV. 



D.l Analysis 

Consider a minor-free graph G with n nodes. The recursive process of removing path separators defines a 
decomposition tree T of G. Each node t e T corresponds to a connected component of G, which we will 
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denote G{t). The root vr of T corresponds to G, namely, G(7r) = G. Denote S{t) the path separator for the 
respective graph G{t). If G{t) \ S{t) = 0, then t is a leaf of T. Otherwise, for each connected component 
^ G G{t) \ S{t) there is a node w eT such that if is a child of t and G{w) = <I>. 

Consider a node t ^ T. Each path p E S{t) has a respective processing order in S{t), denoted order{p), 
which is an integer between 1 and k. The set of previous paths of p, denoted Q{p), is defined to include 
those paths in S{t) which have smaller order, or the paths in the ancestors of t: 

Q{p) = G S{t) : order{q) < order{p)} U {q € S{w) : w is ancestor of t}. 

According to the algorithm, after a new cluster is created (when a path is processed) it may get larger 
when new clusters merge into it (when subsequent paths are processed). Once a cluster is created it can 
never shrink or be removed. Consider a path p G S{t), for some t ^ T. We say that a cluster belongs to 
p if it contains a leader of p. It is easy to verify that a cluster in Vi does not belong to more than one path 
(we will actually show in Lemma [28] that each cluster in Vi belongs to exactly one path). Let Zp denote the 
clusters that belong to p immediately after p is processed. Let Ip denote the final clusters of pinVi. 

In the analysis below, assume that 7 > a, and define a' = ciXk log n, for a constant ci. The parameter 
A denotes the impact of the detour of Algorithm IH on the ratio of the cluster diameter before and after the 
merger merger. From Theorem[l4l A = 0(log^ n). 

Lemma 23. In ^' every cluster of A is within distance at most 87* to a node in LUU. 

Proof. Consider a cluster X ^ A. Let u £ X be the closest to a node v £ pin graph From definition of 
A, dq,{u, v) < 27*. Let (jf be a shortest path in connecting u to v. 

If q uses a cluster outside A, then that cluster must be either a cluster in A/" or a non-integral cluster 
of Vi-i. Therefore, q has to cross a cluster in B. Let i G V{B) n U. Since af^ < -/\ d^>{u,£) < 
27* + Q7*-i < 37*. 

Consider now the case where q uses only clusters in A. Let p' be the subpath of p which consists of the 
nodes within distance 7* from u, with respect to 

Suppose that p' uses only clusters in A. Suppose, for the sake of contradiction, that none of the nodes 
in p' is a leader in L. Let y G ^ be the cluster that contains v. We have that the closest leader to u (if it 
exists), must be at distance greater than 7* from v. Since the diameter of Y is at most 07*"^ < 7*, then 
L is not maximal because w is a valid possible leader. Therefore, p' must contain a leader £ £ L. Thus, 

d,s,>{u,i) < 27' + y < 3y. 

If p' doesn't use a cluster in A, then it has to use a cluster in B. By selecting a node £ G V{B) n U, we 
get d*/ {u, I) < 27* + af'^ < 37*. □ 

Lemma 24. Every cluster ofXp has diameter at most a'7*. 

Proof. From Lemma |23l each cluster in A is within distance a = 87* from a node in L U f/. Since, the 
diameter of each cluster in A is bounded by 07*"^ < 7* < c. Algorithmic produces new clusters around 
the nodes in L U C/, so that each new cluster has diameter at most Act < 3A7*. Thus the cluster in Zp have 
diameter most Q = 3A7*. Similarly, the clusters in Kp have also diameter at most Q. 

The clusters in T{p) may increase in diameter, when they merge with Kq clusters from some path q 
processed after p. This path q may belong to S{t), or it may belong to S{w), where u; is a descendant in the 
sub-tree T' rooted in t. 

Each path q G S{t) with order after p, increases the diameter of Xp by at most 2C,, since newly merged 
clusters from Kq add at most one layer of clusters into Xp, and any two clusters in the layer can reach each 
other through the previous instance of Xp. Thus, when we process the last path in S{t), we have added at 
most k layers, and the increase in the diameter of the new Xp will be at most 2C,k. 
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Similarly, any node in the sub-tree T' , contributes at most k new layers to Xp. However, all the nodes of 
T' in the same level contribute in total k layers, since clusters in them are formed independent of each other. 
Since the sub-tree T has at most 1 + log n levels (including t), we have in total k{l + log n) additional layers 
in Xp, contributing increase at most 2C,k{l + log n) to the diameter of X{p). Therefore, the diameter of X{p) 
is at most 2C,k{\ + log n) + C, < ciXk^'' log n, for some constant ci. □ 

For a path q G Q{p), let X'^ be the clusters of g just before processing path p, and X'^ be the clusters of 
a path q just after processing path p. Let Z'{p) = \Jq^Q{p) and Z"{p) = [jqeQ(p) Define Z{p) = 
Xp U Z"{p). For any set of nodes Y let T{Y) denote the set of clusters in Vi-i which are within distance 
2f from p, namely, T{Y) = {X G Vi-i : daiX, Y) < 2f}. 

Lemma 25. r(p) C Z{p). 

Proof. We prove the claim by induction on For the basis case, = 0, path p is the first to be 

processed by the algorithm with Q{p) = 0. Therefore, r(p) = A{p) =Xp = Z(p). 

Assume now that the claim holds for \Q{p)\ < a. Consider now the case = a + 1. From 

induction hypothesis, for each path q G Q{p), T{q) C Z{q). Let J\f be the new formed clusters of the 
algorithm just before we process p. Since Z{q) C M, we have that T(q) C J\f. 

First, we show that just before we process path p the cluster of that intersect ^ can only be those in 
Z'{p)nAf. Suppose, for the sake of contradiction, that there is a cluster X e M\Z'{p) which intersects 
Cluster X must contain a node y ^ V{"^), since any integral cluster in can only have been built by a path 
in Q{p) n Sy-, where p G S^-. Take a node u G X n V{"^). Any path from utoy must cross one of the paths 
in Q{p) whose removal from G contributed to the formation of 'I'. However, from induction hypothesis all 
the nodes in the paths in Q{p) belong to clusters in Z'{p). Consequently, y cannot exist, and hence neither 
does X. 

Next, we show that any non-integral cluster Y G Vi-i, Y ^ "Pj* i, which intersects ^' is used in a cluster 
of Z'{p). Note that Y must have be crossed by at least a path q € Qip) whose removal from G contributed 
to the creation of ^. Since the diameter of Y is bounded by 07*"^ < 7*, we have that Y G r{q) C Z{q). 
Therefore, Y G Z'{p). 

We continue now with the main claim. Let X G r(p). There are the following possibiUties: 
i'. we examine the following sub-cases. 

- X G M: Since before processing p only clusters in Z'{p) intersect ^, X G Z'{p). Therefore, 
after processing p, X will remain in the same cluster as in Z'ip). Thus, X G Z{p). 

- X ^ A: from the algorithm, after processing p there are two possibilities. First possibiUty is 
X G Ip and hence, X G Z{p). Second possibiUty is X G /Cp \ Ip and X is either (i) adjacent to 
some node in A^, or (ii) adjacent to some non integral cluster in In case (i) X merges with a 
cluster in N, and since only clusters of Z\p) can be in ^, we immediately have X G Z{p). In 
case (ii), as we have shown above any non-integral cluster is a member of Z' (p) C M, and thus 
X merges with a cluster of Z'{p), which impUes that X ^ Z{p). 

• X ^ Vf_i. Then, X must contain a node u ^ ^. If X intersects then we have shown above that 
X G Z'{p), and thus X G Z{p). If X does not intersect any path from p to X must intersect a 
path q G Q{p), since otherwise X wouldn't reside in a different component. Since dciPjX) < 27*, 
we have that dGiq,X) < 2^. Therefore, X G r{q) C Z{q). Consequently, X G Z{p). 

□ 

Lemma 26. Any ball of radius 7* in G intersects with at most 2a' + 3 clusters ofXp. 
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Proof. We start by showing that we only need to consider balls of radius 7* in Let G' = G\'^ . Let Y 
denote the set of nodes in G' such that each x G F is adjacent to a node in ^. It must be that Y C V{Q{p)), 
where V{Q{p)) denotes the nodes of all the paths in Q{p). Let F be all the clusters in 'Pf_i which are at 
distance at most 27* from Y, namely, F = {X £ Vf_^ : dm{X, Y) < 2j'}. Clearly, F = Vf_^ n r(y). 

From Lemma [25l each cluster in T{Y) has been used in the clusters of some path of Q{p) that goes 
through Y . Therefore, the clusters in F are all used in clusters of paths in Q{p). Consequently, the clusters 
of p, Zp, cannot possibly belong in F, namely Tp r\ F = $. When we further process paths in ^ in node 
t (paths ordered after p in S{t)), and then descendants of p, we have that each of the clusters in Ip grows, 
however they wiU never intersect F. Thus, Zpr\ F = %. 

Consequently, any cluster of Zp is at distance at least 27* from G' . Therefore, any ball of radius 7* that 
intersects clusters of Zp has to be a sub-graph of ^. Thus, in order to prove the main claim, we only need to 
focus on graph ^. 

Consider a ball B = B{u,Y) within Suppose that ^ > 2 clusters of Zp intersect p. Path p is a 
shortest path in Each cluster in Zp has a distinct leader in p. The leaders are at distance at least 7* apart 
in p. Therefore, there are two clusters intersecting B, whose respective leaders, £1 and £2, are at distance at 
least d^{£i,£2) > — 1)7*- Ball B provides an alternative path between £1 and £2 through u, with total 
length is bounded by (i<i-(^i,^2) < (i^f(£i,n) + d^{u,£2). Since the cluster of £1 intersects B, we obtain 
from LemmaElthat d^{£i,u) < Q'7* + 7* = (q' + 1)7*. Similarly, d<i,{u,£2) < {a' + 1)7*. Therefore, 
d^{£ij2) < 2(a' + 1)7*. Therefore, it has to be ^ - 1 < 2{a' + 1), or equivalently, ^ < 2a' + 3. □ 

Lemma 27. Any ball of radius 7* in G intersects with at most C20tk log n clusters ofVi, for a constant C2. 

Proof. Consider a node v £ G and the ball B = B{v, 7' ). Each node v £ G belongs to a path p G S{w), 
of some path separator S{'w), w £ T,m the recursive decomposition of G. Clearly, B{v, 7*) C r{p). From 
Lemma [25l we have that B C Z{p). Since Z{p) consists only of clusters that belong to Q' = p U Q{p). 
All the paths in Q' appear in path separators of T between the root and w. Since the depth of T is at most 
1 + log n, the total number of path separators involved in Q' is at most 1 + log n, each contributing k paths. 
Therefore, \Q'\ < k{l -\- logn). 

From Lemma |26l B intersects with at most (2a' + 3) clusters of each path q £ Q'. Thus, the total 
number of clusters of Vi intersecting B is at most (2a' + 3)A;(1 + logn) < C20/ klogn, for a constant C2, 
as needed. □ 

Lemma 28. Vi is a (a' ,C2a'k log n, 'j^) -partition. 

Proof. Every node in G belongs to a path in some path separator used by the algorithm. From Lemma l25l 
each node in a path p must be a member of some cluster which either belongs to p or to a path q G Q{p). 
Consequently, each node v £ G will appear in some cluster of Zg of some path q. Therefore, Vi is a partition 
of G. 

From Lemmas |24j the diameter of any Zg is bounded by a'7*. Therefore, the diameter of each cluster in 
Vi is at most a'7*. From Lemma|27j each ball of radius 7* intersects at most C2a'A; log n clusters of Vi. □ 

Proof of Theorem [TSt From Lemma [28l since k = 0(1), we can build a hierarchy of clusters by choosing 
a = a' = 0(log^ n). Further, for each level i, we can create the necessary padding around a root node 
r G G of radius 7*, by creating a cluster that contains the ball B{r, 7*). We can do this by using either of two 
methods. In the first method, we can explicitly add r to the first separator in G, as an artificial path (with one 
node) that needs to be processed first. This causes the size of the first separator to be of size A; + 1, and in the 
analysis we replace k with k + 1. In the second method, we can merge all the clusters in B{r, 7*) created 
by the algorithm, giving a new cluster whose diameter is no more than three times the diameter of the old 
cluster. Either way, the impact to the parameters of the clustering is a constant factor, giving the desired 
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hierarchical partition. It is easy to verify that all the steps of the algorithm can be performed in polynomial 
time with respect to the size of G and the parameters of the problem. □ 
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